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HI galaxy simulations 


1. Introduction 

Large scale structure surveys have so far relied on imaging of a large number of galaxies at op¬ 
tical or near-infrared wavelengths combined with redshift information to obtain the 3-dimensional 
galaxy distribution (e.g. DBS, BOSS, Euclid). Hydrogen is the most abundant baryon in the Uni¬ 
verse, making it a prime candidate for a tracer of the underlying dark matter distribution and an 
invaluable probe of the energy content of the Universe and the evolution of large-scale structure. 
In order to use hydrogen as a tracer at radio wavelengths, we need to rely on radio observations 
of the neutral hydrogen (HI) 21cm line. With a rest frequency of 1420 MHz, telescopes probing 
the sky between this and 250 MHz would be able to detect galaxies up to redshift 5. Although 
we expect most galaxies to contain HI, since the spin-flip transition responsible for the 21cm sig¬ 
nal is highly forbidden, the emission is quite weak: at z = 1.5 most galaxies with a HI mass of 
10^ M© will be observed with a flux density of ~ 1 /rJy. This implies that for low sensitivity 
surveys, only the closest or Hl-richest galaxies will be observed, and therefore massive telescopes 
will be required in order to detect sufficient numbers of HI galaxies at a level capable of providing 
competitive constraints in Cosmology. This chapter analyses the expected number density of HI 
galaxies as a function of redshift as well as its clustering bias taking into account the flux sensi¬ 
tivities for HI galaxy surveys using the planned SKA telescopes. We discuss the process by which 
we obtain these numbers - using a set of state of the art simulations - identify inherent limitations 
in the current set up and suggest a series of improvements. Although we focus on SKAl and 2, 
the sensitivity calculations presented here allow to easily consider other configurations such as an 
early deployment of SKAl (corresponding to about 50% of its expected sensitivity). 


2. Sensitivity calculations 


The noise associated to the flux density (flux per unit frequency) measured by an interferometer 
can be assumed to be Gaussian with a rms given approximately by 


Os 


'^-ksTsys 
^eff\/25v tp 


( 2 . 1 ) 


for an array with total effective collecting area Aeff, frequency resolution 5v and observation time 
per pointing tp {kg is the Boltzmann constant). The extra factor of 1/ s/l comes from assuming dual 
polarised systems. Telescope sensitivities are sometimes quoted in terms of the "System Equivalent 
Elux Density": SEED = or just "A over T": Aeft/Tsyg. The effective collecting area 

depends on the efficiency of the system which is expected to be the in the range of 70 to 80% for 
the SKAl dishes. 

Note that the expression above gives the flux density sensitivity per resolution beam (not to be 
confused with the dish primary beam or telescope field of view). Moreover, this is the sensitivity 
if the full array is used without constraints on the psf (point spread function or resolution beam). 
This is what is usually called the natural array sensitivity. Although angular resolution is not a 
crucial issue in a HI galaxy survey, some weighting of the visibilities might be required in order to 
obtain a better behaved psf. In that case, the sensitivity of the telescopes will be further reduced. 
The required shape and resolution of the psf will depend on the source detection algorithm for 
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HI galaxies which is still a subject of active research. For SKAl, using uniform weighting plus 
Gaussian tapering of the visibilities can produce "well behaved" Gaussian beams at 10 arc-seconds 
resolution with flux sensitivities that are 30% to 80% worse than what is quoted here (see Braun 
2013). However, new source extraction algorithms might be able to deal with more complex psfs 
(for instance, by working in the visibility space directly) and approach the natural array sensitivity. 
Therefore in this chapter we decided to quote values using the full array, underlining that in reality 
the situation might be less optimistic. 

The equivalent brightness temperature uncertainty is 

where 5 0 is the angular resolution of the interferometer. The total temperature is 


2sys — Tinst + ^sky 


(2.3) 


with Tsky ~ 60 K and Tinst the instrument temperature, which is usually higher than 

the sky temperature above 300 MHz. For typical instrument specifications, the array noise flux rms 
can be written as: 


Os = 260/rJy 



p5,000m2\ 
V Aeff ) 
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(2.4) 


For a given survey area, Sarea we will need approximately Sarea/(0B)^ pointings where (0 b)^ 
is the telescope primary beam (instantaneous field of view) with the full width at half maximum of 
the beam, given by (in radians) 

eB~1.22A/v^, (2.5) 

where Adish is the effective area of each dish. The time per pointing tp is then related to the total 
integration time ttot through 

tp — hot „ • (2-6) 

*2 area 

This will increase the time per pointing at the lowest frequencies (for a fixed ftot)- Note however 
that when doing a survey some overlap of the beams is expected in order to achieve uniformity on 
the noise across the sky map. Following the SKAl imaging performance memo, we assume that 
the mosaicked beams will correspond to a size of: 


oi 


K /1.3Ay 
8 


[sr]. 


(2.7) 


The situation is slightly more complicated for PAFs (Phased Array Feeds) as below a certain critical 
frequency (usually taken as the middle of the band), the beams will remain constant. Therefore, for 
a PAF with a certain number of beams, Nb, we will assume that above the critical frequency, the 
full beam will be Nb x SPF, where SPF is the single pixel feed beam as above (eq. and below 
that critical frequency, it will remain constant. 

Following the current baseline design (Dewdney et al. 2013), table [I] summarises the spec¬ 
ifications for each telescope, while table ^ contains the specifications assumed for the target HI 
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Telescope 

Band [MHz] 

z 

Tinst [K] 

Ndish 

^dish [l^] 

Aeff [m^j 

beam [deg^j “ 

Flux rms [qJy] ’’ 

SKAl-MID 

350 - 1050 

0.35 - 3.06 

28 

190 

15 

21,824 

1.78 

417 

SKAl-MID 

950 - 1760 

(0) - 0.50 

20 

190 

15 

26,189 

0.48 

247 

MeerKAT 

580- 1015 

0.40- 1.45 

29 

64 

13.5 

6,413 

1.68 

1015 

MeerKAT 

900 - 1670 

(0) - 0.58 

20 

64 

13.5 

5,955 

0.64 

1093 

MID H- MK 

580- 1015 

0.40- 1.45 

28.5 

254 

- 

28,237 

1.37 

328 

MID H- MK 

950 - 1670 

(0) - 0.50 

20 

254 

- 

32,144 

0.51 

202 

SKAl-SUR 

350 - 900 

0.58 - 3.06 

50 

60 

15.0 

8,482 

61 (710) 

1918 

SKAl-SUR 

650- 1670 

(0)- 1.19 

30 

60 

15.0 

8,482 

18 (1300) 

1151 

ASKAP 

700- 1800 

(0)- 1.03 

50 

36 

12.0 

3,257 

30(1250)'^ 

4996 

SUR H- ASKAP 

650 - 1670 

(0)- 1.19 

30*^ 

96 

- 

11,740 

18 (1300)'' 

832 

SKA2S 

480 - 1290 

0.1 -2.0 

20 

250’’ 

50 

400,000 

30 

16 


Table 1: Telescope configurations. 

Notes: “ This is the primary beam (FoV) calculated at the center of the band unless stated otherwise. It is 
proportional to (except for PAFs). For the combined telescopes, the smallest beam of the two telescopes 
is used. Flux density rms at the centre of the band (or the frequency shown in parenthesis) for a frequency 
interval of 0.01 MHz and 1 hour integration using Eq. ^ . This is the natural array sensitivity. Requirements 
on the shape of the psf can increase this value by about 50%. Only 500 MHz instantaneous bandwidth. 

PAF beams assumed constant below the critical frequency (shown in parenthesis in MHz) and going as 
1/v^ above that. ® Assuming that all ASKAPPAFs will be replaced to meet the SKAl-SUR frequency band 
and instrument temperature of 30K. ^ Assuming that only band 2 will be initially deployed. ® Values only 
indicative - can be changed. Both beam and fiux rms are assumed constant across the band. ** These should 
be stations (dense aperture arrays). 


galaxy surveys. Note that, in order for the sensitivities to be proportional to v, we assume that 
the mosaicking is done at the highest frequency used for the HI survey (i.e. telescope pointings 
are packed side by side at the highest frequency). This way, at lower frequencies, the beams will 
overlap, since their size varies as 1/v^, which will increase the time per pointing with the cor¬ 
responding improvement in sensitivity. Of course, for PAFs, this sensitivity will remain constant 
below the critical frequency. Table ^ shows that both SKAl instruments (MeerKAT-i-MID and 
ASKAP-i-SUR) have comparable performance for a HI galaxy survey at the highest frequency band 
considered here (band 2). Note however that this assumes that all ASKAP PAFs will have to be 
replaced with new generation receivers at some point. On the other hand, at the lowest frequencies 
(band 1), SKAl-SUR seems to perform better. 


3. HI galaxy surveys 

To calculate the HI galaxy number density and bias as a function of flux rms, we used the SAX- 
sky simulation^. The SAX-Sky simulation consists of a galaxy catalogue containing the position 
and several astrophysical properties for each object in a mock observing cone. It was produced by 
Obreschkow et al. (2009) by adding HI and CO properties to the galaxies obtained by De Lucia 

'http://s-cubed.physics.ox.ac.uk/s3_sax 
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Telescope 

Band [MHz] 

beam [deg^j 

Survey area [deg^j 

tp [hours] 

[fiJy] 

SKAl-MID^ (Band 1) 

350 - 1050 

0.88 

5,000 

1.76 

315 

SKAl-MID** (Band 2) 

950 - 1760 

0.88 

5,000 

1.76 

187 

MeerKAT “ (Band 1) 

580- 1015 

1.06 

5,000 

2.13 

696 

MeerKAT “ (Band 2) 

900 - 1670 

1.06 

5,000 

2.13 

750 

SKA 1-MID H- MeerKAT ** (Band 1) 

580- 1015 

0.88 

5,000 

■Q 

247 

SKA 1-MID H- MeerKAT ** (Band 2) 

950 - 1670 

0.88 

5,000 


152 

SKAl-SUR’’ (Band 1) 

350 - 900 

61 (710) 

5,000 

122 


SKAl-SUR’’ (Band 2) 

650 - 1670 

18 (1300) 

5,000 

36 

■■ 

ASKAP 

700- 1800 

30 (1250) 

5,000 

6 

645 

SKAl-SUR H-ASKAP 

700 - 1670 

18(1300) 

5,000 

36 

140 

SKA2‘= 

480 - 1290 

30 

30,000 

10 

5.14 


Table 2: Survey specifications. We assume a total observation time of 10,000 hours. The corresponding flux 
density rms (5rms ) is calculated for a frequency interval of 0.01 MHz using the natural array sensitivity. 
Notes: ^ Values calculated at the target frequency of 1.0 GHz unless stated otherwise. The beam and time 
per pointing (fp) are assumed to change as ^^j-oss the band. The flux density rms is assumed to 

change as ^ across the band. ** Values calculated at the PAF critical frequency (in parentheses, in MHz). 
Below that frequency, values are assumed constant. Above it, the beam and tp are assumed to go as 1 /v^, 
and the flux density rms as v. Indicative values. The beam and flux density rms are assumed constant 
across the band. 


& Blaizot (2007) through the post processing of the Millennium dark matter simulation (Springel 
et al. 2005). 

The Millennium simulation is a large dark matter simulation with a box size of 500 h^^ Mpe 
and a spatial resolution of 5 h^^ Kpc. It was run in order to study the formation of dark matter 
halos and their evolution with time and therefore comoving snapshots are available at 64 fixed 
time steps from redshifts 127 to 0. The simulation was run using a WM API-compatible cosmol¬ 
ogy and a mass resolution of 8.6 x lO^/i^^M©. The halos were identified as Friends-of-Friends 
groups containing more than 20 particles, and merger trees were built to link each halo with its 
substructures throughout time. In De Lucia & Blaizot (2007) the simulation was post-processed 
in order to populate the halos with simple models of galaxies and used a semi-analytical scheme 
to evolve the galaxies properties independently unless a merger occurs. The properties of each of 
these galaxies includes the mass of the different constituents, such as the mass of the cold and hot 
gas components, as well as the galaxy star formation rate (SFR), Hubble type, etc. 

The atomic and molecular gas components of each galaxy were then obtained by Obreschkow 
et al. (2009) from the cold gas component based on properties of nearby regular spiral galaxies. 
Assuming that the cold gas of these galaxies resides in flat, approximately symmetric disks, it 
follows that the local gas pressure is connected to the molecular gas fraction by a known physically 
based prescription, which is also compatible with observations (at least for the more HI massive 
galaxies). The hydrogen mass of the cold gas component was assumed to be 0.74% of the total 
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mass. In order to properly emulate the light cone, the SAX-sky simulation used only part of each 
snapshot of the Millennium simulation as is described in Obreschkow et al. (2009). Therefore, the 
boxes at fixed redshifts with HI properties which can be obtained from the SAX-sky simulation are 
considerably smaller than 500Mpc along the line of sight. This can undermine the estimation 
of statistical quantities such as the galaxy bias (the number densities, on the other hand, can be 
easily obtained from the queries in the s-cubed web site). 

3.1 Galaxy number counts 

For the detection of a galaxy, we required that at least two points on the HI line are measured, 
that is, the width of the line has to be larger than twice the assumed frequency resolution of the 
survey. The idea is to obtain information on the typical line double peak expected from HI galaxies 
due to their rotation. This in principle will remove any galaxy that is seen “face on” since it would 
show just a narrow peak. More evolved source detection methods can (and should) be explored, to 
avoid spurious detections due to RFI, but we will keep this simple approach for now. A signal to 
noise of 10 is then usually required for the detection of a galaxy. We used the following variables 
and prescription to detect HI galaxies in the SAX database: 

• z apparent - Apparent redshift (including Doppler correction). 

• Experiment spectral resolution set to dV = 2.1(1 + zapparent) at the galaxy rest frame 
(in Km/s). This corresponds to an observed frequency resolution of 0.01 MHz which is what 
has been assumed for the sensitivity calculations. 

• hiwidthpeak [Km/s] - Line width between the two horns of the Hl-line profile in fhe 
galaxy resf frame (already correcfed for fhe galaxy inclination). 

• Following whaf has been discussed, we fake only galaxies where hiwidthpeak/2 > dV. 

• hi int flux [Jy Km/s] - Velocify-infegrafed line flux of fhe Hl-line (in fhe observers frame). 

• For each galaxy fhe observed sensifivify in terms of flux densify (flux per frequency) is fhen 
fluxd = hiintf lux/hiwidthpeak. Note fhaf fhis already factors in fhe relation befween 
velocify infegrafed flux and fhe frequency infegrafed flux as well as fhe relation befween fhe 
velocify line widfh in fhe resf frame and fhe corresponding observed frequency. 

• Finally, we lake only galaxies where flux > Acut^rms/ s/ (hiwidthpeak/r/F) (we look 
Acut = 10 for a "10-sigma" cul). 

In order to be as general as possible we did nol fry to malch completely fhe flux rms used in 
fhe query lo fhe survey specifications given in fable Instead we give resulls for differenl values 
so lhal a simple interpolation can be used for differenl surveys. In parlicular, if we decide fo use 
a 5a cul instead of 10a, if is jusf a question of looking for fhe numbers corresponding to fhe flux 
rms fhaf is half fhe survey value (since fhe oblained number counfs all assumed a 10-sigma cuf). 
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Figure 1: The dimensionless HI galaxy power spectrum for different flux cuts using the prescription de¬ 
scribed above for each flux rms. Black solid line shows the dark matter power spectrum from CAMB. The 
straight lines at high k indicate shot noise. 


3.2 Galaxy bias 


To obtain the bias, the “detected” galaxies were put in a box, for which the power spectrum 
of the number counts was calculated. The bias squared was then taken as the ratio of that power 
spectrum to the dark matter one at k = 0.2 h/Mpc. To perform this study we used boxes with 
different sizes from L|| = 58 /j^^ Mpc and Lj_ = 100/j^^ Mpc for redshift 0.06 to L|| = \62h^^ 
Mpc and = 398/z^^ Mpc for redshift 2.07. This makes the bias extraction problematic, since 
we cannot efficiently probe modes below ~ 0.1 h/Mpc, where we can safely neglect non-linear 
effects. For high flux rms, the number of galaxies is low and the shot noise dominates up to very 
small wave numbers. The conclusion is that for high flux/redshift values the results should be taken 
as only an indication, in particular for bias above ~ 20/iJy flux rms. As an example we show in 


Figure |3JJ the dimensionless HI galaxy power spectrum at z = 1 for different sensitivities. Also 
note that the Millennium simulation only has galaxies with masses > 2 x and therefore the 

bias and number density for rms=l might be affected by the lack of smaller galaxies. However this 
should not affect the statistics for higher rms values. The cosmological analysis should compare 
results for different fiducial values and fully marginalise over the bias and number counts. 


4. Fitting functions for galaxy number counts and bias 

In this section we describe fitting functions for the galaxy number counts, dn/dz, and bias, 
b{z), as a function of redshift. At a constant flux rms, these are well described by the functions 
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(Yahya et al. 2014) 


dn/dz = 10^‘z‘^^exp(—C 3 z) (4.1) 

b{z) = C 4 exp(c 5 z), (4.2) 


where c,- are free parameters, dn/dz is the number of galaxies per square degree, per unit redshift. 
Best-fit values of the parameters are given in Table and Fig. ^ compares the fitted curves with 
data points from the simulations. 

Note that for a redshift bin of width Az, the observed number of galaxies in a 1 deg^ patch is 
and the comoving galaxy number density is n{z) = ^ dv)dz ’ "'here ^ = ^(7r/180)^r(z)^, 
r(z) is the coming distance to redshift z and dr/dz = c/H{z)- 

As explained above, the flux sensitivities for the telescopes listed in Table ^ are taken to ei¬ 
ther evolve with frequency, S'rms or are assumed constant across the band. For the non-PAF 
receivers (SKA 1-MID and MeerKAT), the flux rms as a function of redshift is 


c l^\ — cflef) 

1,41^ — *->rms 


A^cut\ V21 ,, ^ 

10 j 1 GHz^ 


(4.3) 


where Srm^ is the reference flux rms at 1 GHz listed in Table Ncut is the detection threshold in 
multiples of the flux rms, and V 21 ^ 1.42 GHz is the rest frame frequency of the HI line. For the 
PAF receivers (SKAl-SUR and ASKAP), the flux rms is 


,(z)=5Sf feix 


1 V < Vcrit 

^(l+z)-l v> Vcrit 


(4.4) 


where Vcrit is the PAF critical frequency listed in Table ^ and is the reference flux at the 
critical frequency. The full SKA has Srmslz) = const. We take Ncut = 5 for all but the full SKA, to 
which we apply a more stringent 10a threshold. To obtain the number counts and bias for a given 
telescope, we interpolate Eqs. ^ and as a function of flux rms using the values given in Table 
^ and evaluate them as a function of redshift using the appropriate Srms (z) function. We then fit the 
same fitting functions to the results - best-fit parameters are given in Table ^ Note that while the 


frequency-corrected number counts are all satisfactorily fitted by Eq. |4.1| , four of the bias functions 
are not well described by Eq. |4.2^ these are flagged in the table. 


5. Limitations of current simulations and future improvements 

Although the SAX-Sky is a state-of-the-art simulation in terms of cosmological radio surveys, 
it suffers from a number of shortcomings that would be desirable to address in the future, in order 
to improve the quality of the forecasts for the SKA. 

Probably the most challenging problem for this type of simulations is finding a compromise 
between the enormous volume that experiments like the SKA are expected to probe, and the com¬ 
plexity necessary to simulate the type of objects that will be observed with sufficient precision. 
In our case, we would ideally need a simulation with a box of size Lbox ~ 15/j^^Gpc and resolv¬ 
ing haloes down to ~ which is simply out of the question given the present typical 
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red shift (z) 



Figure 2: Upper panel; Dependence of the HI galaxy redshift distribution dn/dz (units: deg^^). Note that 
the numbers are for different flux rms which will in turn correspond to a given galaxy flux cut according to 
the source detection procedure described in the text (which assumed a 10-sigma cut). Curves follow the fits 
according to Eq. f.l and dots are from the S^-SAX simulation. Lower Panel; HI galaxy bias for different 
Srms- Note that above 70 /rJy values for high redshifts are purely extrapolations. However, this has little 
impact as at high z, shot noise will dominate for these sensitivities. 
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^rms [/^Jy] 

Cl 

C2 

C3 

C4 

C5 

0 

6.23 

1.82 

0.98 

0.8695 

0.2338 

1 

7.33 

3.02 

5.34 

0.5863 

0.6410 

3 

6.91 

2.38 

5.84 

0.4780 

0.9181 

5 

6.77 

2.17 

6.63 

0.5884 

0.8076 

6 

6.84 

2.23 

7.13 

0.5908 

0.8455 

7.3 

6.76 

2.14 

7.36 

0.5088 

1.0222 

10 

6.64 

2.01 

7.95 

0.4489 

1.2069 

23 

6.02 

1.43 

9.03 

0.5751 

0.9993 

40 

5.74 

1.22 

10.58 

0.5125 

1.1842 

70 

5.62 

1.11 

13.03 

0.6193 

1.0179 

100 

5.63 

1.41 

15.49 

0.6212 

1.0759 

150 

5.48 

1.33 

16.62 

- 

- 

200 

5.00 

1.04 

17.52 

- 

- 


Table 3: Values of the fitted parameters for Eqs. 4.1 and 4.2. For higher flux cuts we suggest taking b{z)=l, 
as only galaxies below z ^ 0.5 are relevant anyway. 


Telescope 

Cl 

Cl 

C3 

C4 

C5 

A^gal 

SKAl-MID (Band 1) 

4.6399 

0.8577 

13.402 

0.8675* 

0.4767* 

7.40 X 10“* 

SKAl-MID (Band 2) 

5.3662 

1.2611 

13.979 

0.7038* 

0.8616* 

3.39 X lO*^ 

MeerKAT (Band 1) 

4.9849 

1.8010 

17.359 

1.0000 

0.0000 

6.27 X 10^ 

MeerKAT (Band 2) 

7.6813 

4.5740 

28.067 

1.0000 

0.0000 

1.22 X 10^ 

SKAl-MID -F MeerKAT (Band 1) 

5.1667 

1.1585 

14.170 

0.8116* 

0.7163* 

7.76 X 10^ 

SKAl-MID -F MeerKAT (Band 2) 

5.4381 

1.3315 

11.836 

0.6343 

0.9708 

4.96 X 10*^ 

SKAl-SUR (Band 1) 

5.1814 

1.1668 

14.208 

0.8108* 

0.7174* 

9.95 X 10^ 

SKAl-SUR (Band 2) 

5.9023 

1.5879 

16.149 

0.6232 

1.0636 

4.25 X 10'’ 

ASKAP 

4.8577 

1.1551 

17.499 

1.0000 

0.0000 

8.08 X 10^ 

SKAl-SUR- fASKAP 

5.8611 

1.4742 

15.344 

0.6206 

1.0559 

5.55 X 10'’ 

SKA2 

6.7767 

2.1757 

6.6874 

0.5887 

0.8130 

9.12 X 10* 


Table 4; Values of the best-fit number count/bias parameters (Eqs. 0 and |4.2| ) for individual telescopes, 
after scaling the flux rms with frequency and interpolating. Some of the resulting bias functions are not fit 
very well by Eq. 4.2; these are flagged. The total number of galaxies expected for each survey is also shown 
(with 5,000 deg^ except for SKA2). 
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computational resources. One must therefore accept a number of approximations, which simplify 
the problem while preserving the accuracy of the relevant physical processes. Recently there has 
been significant progress, sparked by the computational needs of the cosmological redshift surveys 
community, in finding alfernafive numerical mefhods, such as fhe producfion of fasler simulations 
of the matter distribution that are reliable on cosmological scales (e.g. Tassev et al. 2013; White 
et al. 2014). 

Even using these methods, it would still be difficult to reach the required mass resolution. 
However, a number of methods have been proposed to extend the mass range in a consistent way 
(Sousbie et al. 2008; de la Torre & Peacock 2013). Also, once a simulation has been performed 
using a particular cosmological model, it can be efficiently modified fo any alfernafive model in 
order fo sfudy fhe cosmological dependence of differenl observables (Mead & Peacock 2014). 
This kind of mefhods can also be used fo generafe nof jusf one, buf a large number of realisafions, 
which can be used, for insfance fo compute uncerfainfies in a rigorous way. 

Furfhermore, while fhe galaxy populafion confained in fhe SAX-Sky simulafion was created 
applying a semi-analyfical model fo fhe dark maffer disfribufion of fhe Millennium simulafion, if 
would be of inferesf (if only for comparison) fo use a prescripfion based on fundamenlal principles, 
using resulfs from hydrodynamical N-body simulations (Dave el al. 2013). 

6. Conclusions 

Simulafions calculating fhe HI confenf of galaxies will be crucial fo predicf fhe number of 
galaxies fhaf will be observed wifh fulure HI surveys. Bofh fhe number counfs and bias will be 
fundamenlal ingredienls in order fo forecasl fhe conslraining power for cosmology of lelescopes 
such as fhe SKA. In Ibis chapfer we analysed explicilly fhe expected flux sensifivifies for fhe dif- 
ferenf SKA sefups (bofh for phase 1 and 2) as well as fhe number counfs and bias. Calculations 
show similar noise rms for SKAl-SUR and SKAl-MID. Numbers were oblained using fhe SAX- 
simulalions and following a prescripfion for delecling fhe HI sources. The bias is shown fo have 
a value around 1 al z ~ 0.7, increasing wifh bofh redshifl and assumed flux cuf (dependence on 
redshifl is more prominenl for high flux culs). Assuming a 5-sigma cuf, a 5,000 deg^ survey wifh 
MID or SUR, should find around 5 x 10^ galaxies. This can already have inleresling applicalions 
for cosmology, buf only al low redshifls, z ~ 0.5. SKA2 should be able fo deled aboul 10^ galaxies 
over 30,000 deg^ wifh a 10-sigma cuf. This will make if fhe ullimale "cosmological machine". 
Furlher work is still required in order fo improve fhe numbers. In parlicular: i) source deleclion 
algorilhms should be compared using real sky and inslrumenl simulations fo address fhe efficiency 
of HI galaxy deleclion; ii) larger simulafions need fo be done in order fo accounl for cosmological 
scales and in parlicular provide a heller calculalion of fhe bias; iii) full hydrodynamical simulations 
should be used in order fo calibrate fhe larger simulafions and confirm fhe HI confenf evolulion of 
galaxies wifh redshifl. Neverfheless, fhe numbers oblained in Ibis chapfer should already give a 
reasonable forecasl of fhe expected measuremenls, in parlicular for fhe galaxy number counfs. 
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